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THE  VALIDITY  OF  PERFORMANCE  RATINGS  ADJUSTED  FOR  ENVIRONMENTAL  CONDITIONS 


BRIEF 


Requirement: 

Maintaining  Army  readiness  requires  valid  and  reliable  assessments  of 
soldier  performance.  This  report  investigates  a  method  to  increase  the 
accuracy  of  performance  evaluations  by  adjusting  for  the  effects  of  envir¬ 
onmental  factors. 


Procedure: 

Soldiers  were  rated  on  various  Soldiers'  Manual  tasks  by  officer  eval¬ 
uators  during  an  extended  field  training  exercise.  These  "raw"  score  ratings 
were  adjusted  to  take  into  account  the  effects  of  seven  different  environmental 
conditions  (e.g.,  climate,  visibility,  etc.).  The  unadjusted  and  adjusted 
ratings  were  then  compared  with  supervisors'  daily  performance  ratings  to 
determine  if  the  adjustments  increased  the  accuracy  of  the  officers'  evalua¬ 
tions.  Before  the  hypothesis  that  ratings  which  were  adjusted  for  environ¬ 
mental  conditions  were  more  accurate  than  unadjusted  ratings  could  be  tested, 
it  was  necessary  to  determine  if  the  evaluating  officers  and  supervisors 
were  measuring  the  same  construct.  A  significant  correlation  between  the 
raters'  scores  would  indicate  they  were. 


Findings: 

Performance  ratings  of  females  by  the  two  types  of  raters  were  signifi¬ 
cantly  correlated.  Hence,  the  hypothesis  of  increasing  the  validity  (accuracy) 
of  the  ratings  by  adjusting  them  could  be  tested.  It  was  found  that  the  corre¬ 
lation  of  the  adjusted  ratings  with  supervisors'  ratings  was  higher  than  the 
correlation  of  unadjusted  ratings  with  supervisors'  ratings.  This  difference 
between  the  two  correlations  approached  statistical  significance.  In  addition, 
adjusting  the  ratings  made  them  significantly  more  reliable  than  the  unadjusted 
ratings.  Performance  ratings  of  the  males  by  the  two  types  of  raters  were  not 
significantly  correlated,  which  precluded  further  analysis  for  this  group. 


Utilization  of  Findings 

The  technique  described  in  this  report  shows  promise  for  increasing  the 
reliability  and  validity  of  performance  ratings  made  in  situations  in  which 
environmental  conditions  may  affect  performance  ratings.  More  research  is 
needed  to  develop  the  technique. 
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THE  VALIDITY  OF  PERFORMANCE  RATINGS, 
ADJUSTED  FOR  ENVIRONMENTAL  CONDITIONS* 


INTRODUCTION 


Background 

Rating  scales  have  long  been  used  to  assess  performance.  When  the  perform¬ 
ance  being  measured  takes  place  in  a  location  where  conditions  may  vary  widely, 
valid  performance  ratings  become  problematic.  A  soldier's  performance  on  a 
task  in  an  Army  field  environment,  for  example,  may  be  viewed  as  due  to  his/ 
her  personal  characteristics  (such  as  ability,  achievement,  and  motivation) 
plus  an  effect  of  the  environmental  conditions  which  surround  the  measurement. 

Two  recent  research  efforts  by  the  Army  Research  Institute  have  focused  on  per¬ 
formance  during  field  training  exercises.  The  first  of  these,  MAX  WAC  (Army 
Research  Institute,  1977),  was  concerned  with  the  effect  of  the  proportion  of 
women  on  the  accomplishment  of  a  unit's  mission  as  measured  by  scores  on  ARTEP 
(Army  Training  and  Evaluation  Program)  tasks.  REF  WAC,  the  second  research 
effort  (Johnson,  Cory,  Day,  &  Oliver,  1978),  was  primarily  concerned  with  the 
comparative  performance  of  men  and  women.  In  the  earlier  MAX  WAC  research, 

ARTEP  performance  measures  were  relatively  imprecise  for  the  MAX  WAC  Army 
officer  evaluator  used  only  a  3-point  scale  Cl  “  unsatisfactory;  2  »  satis¬ 
factory;  3  ■  outstanding)  to  assess  performance  on  ARTEP  tasks  such  as  erecting 
a  hospital  tent  or  serving  as  a  convoy  during  night  maneuvers.  The  evaluators 
used  a  very  subjective  procedure  in  arriving  at  their  final  ratings.  For  example, 
in  evaluating  the  performance  of  a  transportation  unit  moving  supplies  in  a  rain¬ 
storm,  the  evaluator  might  take  into  account  the  inclement  weather  in  rating  the 
quality  of  the  unit's  performance. 

The  REF  WAC  research  built  upon  the  MAX  WAC  experience  and  attempted  to  make 
more  precise  and  objective  the  measures  used  in  assessing  individual  performance 
on  Soldier' 8  Manual  tasks  such  as  investigating  a  traffic  accident.  For  each 
individual  event  (task)  to  be  evaluated,  a  set  of  criteria  was  constructed  (e.g., 
"identifies  and  secures  statements  from  drivers  and  witnesses") .  Each  criterion 
was  rated  as  satisfactory  or  unsatisfactory,  and  a  final  rating  was  made  on  a 
7-point  scale.  In  addition,  ratings  of  the  effects  of  seven  environmental  condi¬ 
tions  (such  as  weather  or  leadership)  were  averaged  to  obtain  an  environment 
score.  The  raw  ("unadjusted")  score  and  the  environment  score  were  summed  to 
obtain  an  "adjusted"  score.  Thus,  both  the  MAX  WAC  and  REF  WAC  research 
attempted  to  correct  for  error  caused  by  environmental  conditions  surrounding  the 
measurement.  The  REF  WAC  measures,  profiting  from  the  MAX  WAC  research,  were 
the  more  refined  and  objective  measures. 


A  version  of  this  paper  was  presented  at  the  meeting  of  the  American  Psychological 
Association,  Montreal,  September  1980.  The  authors  wish  to  extend  their  appre¬ 
ciation  to  Dr.  John  J.  Mel linger  and  Mr.  Sidney  A.  Sachs  for  their  assistance  with 
the  statistical  analysis. and  to  Dr.  Melvin  J.  Kimmel  for  his  helpful  comments. 
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Plus,  of  course,  measurement  error.  This  paper  addresses  the  problem  of  non- 
random  bias  caused  by  environmental  conditions  rather  than  traditional  measurement 
error. 


1 


Although  the  REF  WAC  measures  Improved  upon  those  developed  for  the  earlier 
MAX  VAC  research.  It  was  still  not  known  to  what  extent  the  adjusted  ratings 
used  in  REF  WAC  reduced  error  and  thus  more  closely  represented  the  individual's 
true  level  of  performance.  The  purpose  of  the  present  research  was  to  investi¬ 
gate  the  validity  of  the  adjusted  ratings.  If  the  adjusted  ratings  were  a  more 
accurate  assessment  of  an  individual's  performance,  the  adjusted  ratings  should 
correspond  more  closely  to  another  measure  of  individual  performance  (daily 
performance  ratings  by  the  soldiers'  supervisors)  than  would  the  unadjusted  or 
raw  score  ratings.  Accordingly,  the  following  hypothesis  was  tested  for  males 
and  for  females: 

The  correlation  of  adjusted  individual  event  ratings  with 
supervisors'  dally  performance  ratings  is  significantly 
higher  than  is  the  correlation  of  unadjusted  individual 
event  ratings  with  supervisors'  daily  performance  ratings. 

METHOD 

The  Instruments,  subjects,  and  procedures  used  in  the  REF  WAC  research 
project  are  described  in  detail  in  Johnson  et  al.,  1978.  Brief  descriptions 
of  the  methodological  aspects  which  pertain  specifically  to  the  research  re¬ 
ported  here  are  given  below. 


Subjects 


The  sample  for  this  research  consisted  of  151  enlisted  personnel  (97  women 
and  54  men)  in  22  Army  companies  participating  in  an  extended  (11-day)  field 
training  exercise  in  Europe.  The  sample  constituted  a  subset  of  a  larger  group 
of  male  and  female  service  members  who  were  the  subjects  of  the  REF  WAC  research 
described  in  Johnson  et  al.  (1978) .  The  larger  group  consisted  of  a  female 
cohort  (N  ■  200) ,  which  included  all  women  in  the  22  companies  and  a  male  cohort 
(N  -  196)  containing  men  who  had  been  matched  with  the  women  on  the  basis  of 
company,  paygrade,  length  of  service,  military  occupational  specialty  (Army 
job),  and  intelligence.  Due  to  transfers  of  personnel  to  other  units  and  to 
the  fact  that  members  of  the  male  cohort  were  frequently  not  available  for  per¬ 
formance  ratings  on  individual  tasks,  complete  data  were  available  for  only  97 
women  and  54  men. 

Instruments 

Individual  Performance  Score  Sheet  (Individual  Event  Rating  Module) .  This 
Instrument  was  used  by  officer  evaluators  to  rate  individuals  on  individual 
events,  which  were  specific  Soldier's  Manual  tasks.  The  tasks  selected  were 
those  which  the  researchers  felt  would  be  typical  of  a  given  type  of  unit  and 
which  were  expected  to  occur  with  some  frequency  during  the  field  training 
exercise  (such  as  "Investigate  Traffic  Accidents”  for  Military  Police  units). 

The  rating  form  contained  scoring  criteria  that  were  equivalent,  where  possible, 
to  those  provided  by  the  Training  and  Doctrine  Command  (TRADOC)  for  the  Soldier' 
Manual  task.  Criteria  for  the  task  were  enumerated  on  the  form  (e.g.,  "Measures 
and  records  skidmarks  and  final  position  of  vehicles"),  and  the  evaluator  was 


instructed  to  score  each  criterion  as  satisfactory,  unsatisfactory,  or  not 
applicable.  After  considering  the  scores  on  all  criteria,  the  evaluator  then 
produced  a  performance  rating  on  a  7-point  scale.  This  scale  incorporated  key 
words  and  phrases  from  the  operational  Enlisted  Evaluation  Report  (DA  Form  2155-55) , 
ranging  from  "Performed  all  tasks  in  a  superior  manner..."  (rated  7)  to  "Performed 
all  tasks  in  an  inferior  manner..."  (rated  1). 

In  addition  to  the  performance  rating  an  "environment  score"  was  also  recorded. 
The  environment  score  was  the  evaluator's  estimate  of  the  degree  to  which  environ¬ 
mental  conditions  had  affected  the  individual's  performance.  Each  of  seven  envi¬ 
ronmental  conditions  (weather,  terrain,  equipment,  night,  leadership,  instructions, 
and  activity  level)  was  rated  on  a  three-point  scale.  The  three  points  of  this 
scale  were  identified  as:  optimal  conditions  (0),  mildly  adverse  conditions  (1), 
and  severely  adverse  conditions  (2).  After  considering  his/her  ratings  of  the 
seven  environmental  conditions,  the  evaluator  recorded  an  overall  environment 
score  which  reflected  his/her  estimate  of  the  combined  effect  of  the  seven  con¬ 
ditions  on  a  7-point  scale  ranging  from  0  to  3  with  half-point  increments.  The 
two  scores  just  described  (the  raw  score  performance  rating  and  the  environment 
score)  were  then  summed.  Hence,  the  "unadjusted"  score  was  the  raw  score  per¬ 
formance  rating,  and  the  "adjusted"  score  was  the  raw  score  corrected  for  envi¬ 
ronmental  conditions.  (See  Appendix  A,  page  A-l,  for  an  example  of  an  individual 
event  rating  module.)  The  correlation  of  the  unadjusted  ratings  and  the  adjusted 
ratings,  based  on  151  subjects  (97  males  and  54  females),  was  .90. 

The  reliability  of  the  individual  event  ratings  was  also  investigated.  Two 
or  more  scores  were  available  for  175  subjects  (89  males  and  97  females),  for 
whom  the  mean  number  of  ratings  was  4.2.  Since  many  subjects  had  more  than  two 
ratings  and  there  were  unequal  numbers  of  ratings  per  subject,  the  intraclass 
correlation  (Ebel,  1967)  was  used.  The  intraclass  correlation  was  .44  for  the 
unadjusted  ratings  and  .57  for  the  adjusted  ratings.  A  comparison  of  these 
intraclass  correlations  (Fisher,  1950)  revealed  that  they  differed  significantly 
(£<.05),  thus  demonstrating  that  adjusting  the  performance  ratings  significantly 
increased  the  reliability  of  the  ratings. 

Daily  Record  of  Work  Availability  and  Performance  (Schedule  4) .  This  form 
was  used  by  non-commissioned  officer  (NCO)  data  collectors  to  record  supervisors' 
ratings  of  the  performance  of  the  male  and  female  subjects.  These  daily  perfor¬ 
mance  ratings  were  on  the  same  7-point  scale  that  was  used  for  the  individual 
event  ratings  described  above  and  were  measures  of  the  subject's  overall  per¬ 
formance  on  the  preceding  day.  The  ratings  were  collected  daily  from  the  indi¬ 
vidual's  supervisor. 

To  assess  the  reliability  of  the  supervisors'  daily  performance  ratings, 
a  split-half  procedure  was  used.  Daily  ratings  were  available  for  278  sub¬ 
jects  (141  from  the  male  cohort,  and  137  from  the  female  cohort).  As  the  corre¬ 
lation  between  the  odd-day  and  the  even-day  ratings  was  .93  for  this  group,  one 
can  conclude  that  these  ratings  were  highly  reliable. 

Data  Collection 

Individual  event  ratings.  The  performance  of  individuals  on  the  Soldier's 
Manual  tasks  was  rated  by  the  officer  evaluators,  who  used  the  Individual  Per¬ 
formance  Score  Sheet  described  above.  An  effort  was  made  to  observe  each  of 


the  female  soldiers  on  a  variety  of  tasks.  However,  the  evaluators  had  no 
control  over  the  type  of  tasks  performed  or  the  frequency  of  task  occurrence. 
After  a  woman  was  observed  and  rated,  a  man  from  the  same  company  and  as  similar 
to  her  as  possible  in  age  and  rank  was  rated  on  the  same  task. 

Supervisors'  daily  performance  ratings.  The  Schedule  4  (Daily  Record  of 
Work  Availability  and  Performance)  data  were  collected  by  noncommissioned  offi¬ 
cers  (NCOs)  assigned  to  each  company.  Each  day  during  the  field  training  exer¬ 
cise,  the  NCO  data  collectors  obtained  performance  ratings  for  each  male  and 
female  subject  in  their  assigned  units.  These  ratings  were  of  the  subject's 
overall  performance  on  the  preceding  day. 

Data  Analysis 

Before  a  meaningful  test  could  be  made  of  the  hypothesis,  it  was  necessary 
to  determine  if  the  two  types  of  performance  ratings  (individual  event  ratings 
and  supervisors'  daily  performance  ratings)  did  in  fact  measure  the  same  con¬ 
struct — namely,  an  individual's  job  performance  during  the  field  training  exer¬ 
cise.  Accordingly,  the  supervisors'  daily  ratings  were  correlated  with  (1)  the 
unadjusted  individual  event  ratings  and  (2)  the  adjusted  individual  event  rat¬ 
ings  for  the  male  and  female  groups.  A  correlation  which  differed  significantly 
from  zero  provided  evidence  of  construct  validity — i.e.,  both  types  of  perfor¬ 
mance  measures  (Individual  event  ratings  and  supervisors'  daily  ratings)  appeared 
to  have  significant  variance  in  common.  In  such  a  case,  it  was  reasonable  to 
continue  the  analysis  and  test  the  hypothesis  by  assessing  the  significance  of 
the  difference  between  the  correlations  of  (1)  unadjusted  individual  ratings 
and  supervisors'  dally  ratings  and  (2)  adjusted  individual  ratings  and  super¬ 
visors'  dally  ratings  by  using  Hotelling's  £  test  for  the  significance  of  the 
difference  between  nonindependent  j:'s  (Edwards,  1963).  If  the  latter  correla¬ 
tion  should  be  higher  than  the  former,  the  findings  would  suggest  that  adjusting 
the  ratings  makes  them  more  accurate — i.e.,  increases  their  validity. 

Results 


The  correlation  of  unadjusted  individual  ratings  and  supervisors'  daily 
ratings  (.32)  and  the  correlation  of  adjusted  individual  ratings  and  super¬ 
visor's  daily  ratings  (.40)  were  both  significantly  different  from  zero  at  the 
.001  level  for  the  female  sample.  For  males,  however,  neither  the  correlation 
between  unadjusted  individual  task  ratings  and  supervisors'  daily  ratings  (.01) 
nor  the  correlation  between  adjusted  individual  task  ratings  and  supervisors' 
daily  ratings  (-.09)  varied  significantly  from  zero. 

In  view  of  these  findings,  the  hypothesis  was  tested  only  for  the  female 
group  using  a  two-tailed  _t  test  for  nonindependent  jr’s  (Edwards,  1963).  The 
_t  value  for  the  difference  between  the  correlation  of  unadjusted  individual 
event  ratings  and  daily  supervisors'  ratings  and  the  correlation  of  adjusted 
individual  event  ratings  and  daily  supervisors'  ratings  was  1.90.  This  finding 
approached  but  did  not  attain  significance  (t^  qj  ■  1.99,  d_f  ■  94). ^  Table  1 
summarizes  these  results. 


It  would  be  possible  to  argue  for  a  one-tailed  test,  although  the  authors 
chose  not  to  do  so.  The  difference  between  the  correlations  would  be  signif¬ 
icant  at  the  .05  level  if  a  one-tailed  test  were  used. 


Discussion 


The  limitations  of  the  data  collection  procedure  should  be  kept  in  mind 
in  considering  the  results  of  this  research.  The  noninterference  constraint, 
for  example,  limited  the  opportunities  for  observing  Individual  events  to  such 
an  extent  that  only  70  of  the  113  different  modules  prepared  were  actually 
used.  Observations  of  individual  events  were  thus  largely  a  matter  of  chance, 
with  a  low  probability  of  observing  the  paired  male.  In  addition,  mean  scores 
for  individuals  were  sometimes  based  upon  only  one  or  two  observations. 

However,  the  findings  show  that  the  procedure  for  adjusting  the  raw  score 
ratings  had  positive  consequences  for  both  the  reliability  and  validity  of  the 
ratings.  The  reliability,  for  example,  increased  significantly  (j>  .05)  as  a 

result  of  applying  the  correction  procedure. 

The  results  reported  here  also  provided  evidence  of  construct  validity 
for  the  two  types  of  performance  ratings  used  in  this  research.  Correlations 
were  significant  between  the  unadjusted  Individual  event  ratings  and  super¬ 
visors'  daily  ratings  and  between  the  adjusted  individual  event  ratings  and 
supervisors'  daily  ratings.  However,  this  finding  applied  only  to  the  female 
subjects.  These  correlations  were  not  significant  for  the  male  group. 

This  evidence  of  construct  validity  is  of  particular  Interest  because  the 
two  types  of  performance  ratings  were  very  different.  The  individual  event 
rating  dealt  with  a  specific  task,  while  the  supervisors'  ratings  were  global 
in  nature.  Further,  the  Individual  ratings  were  made  by  an  officer  observer 
not  connected  with  the  soldiers'  unit,  whereas  the  supervisors'  ratings  were 
made  by  someone  who  knew  the  soldier  well. 

The  findings  also  suggest  that  adjusting  raw  scores  for  environmental 
conditions  results  in  a  more  valid  measure  of  performance.  For  women,  the 
difference  between  the  correlation  of  the  adjusted  ratings  with  supervisors' 
ratings  and  the  correlations  of  the  unadjusted  ratings  with  those  ratings 
approached  significance.  Therefore,  it  appears  that  the  procedure  for  adjust¬ 
ing  the  raw  scores  for  environmental  conditions  had  positive  consequences  for 
the  validity  of  the  performance  ratings.  While  not  definitive,  these  findings 
are  viewed  as  encouraging  for  the  further  development  of  the  correction  pro¬ 
cedure. 

Since  the  positive  results  concerning  validity  were  found  only  for  the 
female  sample,  the  question  arises  as  to  why  this  outcome  occurred.  The  focus 
of  the  data  collection,  as  perceived  by  the  officers  and  NCOs  charged  with 
collecting  data,  was  on  female  performance.  It  appeared  that  officers  and 


NCOs  who  collected  data  made  a  special  effort  to  follow  the  specified  procedures 
for  women  and  were  somewhat  more  casual  about  the  collection  of  parallel  male 
data  because  they  believed  the  latter  to  be  of  lesser  importance. ^  Improved 
data  collection  procedures  would  minimize  such  problems,  and  perhaps  confirm 
the  hypotheses  for  male  as  well  as  female  subjects. 


4In  fact,  at  one  point  about  halfway  through  the  field  training  exercise,  it 
was  necessary  to  instruct  data  collectors  to  observe  more  all-male  groups  as 
most  of  the  group  event  data  being  collected  were  on  groups  containing  one  or 
more  women.  Although  the  group  event  ratings  are  not  considered  in  this  paper, 
the  incident  does  indicate  the  emphasis  the  data  collectors  placed  on  observing 
the  performance  of  women. 
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Table  1 


L< 


Correlations  of  Daily  Performance  Ratings  with  Unadjusted  and  Adjusted 
Individual  Event  Ratings  for  Males  and  Females 


Individual 

Event  Ratings 

Group 

Unadjusted 

Adjusted 

t  a 

Males  (n  “  54) 

.08 

-.01 

.40  (n.s.) 

Females  (n  ■  97) 

.32*** 

.40*** 

1.90  (n.s.) 

.t  test  for  the  significance  of  the  difference  between  nonindependent  _r's 
(two-tailed  test) 

***p<.001  that  correlation  differs  significantly  from  0. 
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INSTRUCTIONS 

(INDIVIDUAL  EVENT  RATING  MODULE) 

•EVENT  TITLE:  Descriptive  title  of  event 

•TOE  AUTliOHlZKD  JOS  TITLE:  As  extracted  from  appropriate  lino  in  authorization  document. 

•TEAM  CODE  SEQUENCE  NUMBER:  Includes  tcan  designation  and  nuucrical  team  sequence  nuc.bc r  at 

assigned  by  the  tcan  chief  e.g. ,  H? 01,  Tell,  SCI},  ItCO 3 ,  KTJ4, 

NAME  -  FEMALE:  None  and  rank  of  Subject. 

NAME  -  COHORT:  Same  as  above.  A  cohort  oust  be  tracked  with  each  female. 

UNIT:  Designation  of  company  to  include  battalion  headquarters  if  lettered  coopany. 

DTC:  Date  and  time  of  observation  in  six  numerical  digits  with  the  dace  flrat-e.g. 141305  Aug;; 

OBSERVATION  TIME  LENGTH:  The  ti»c  which  transpired  during  the  observation. 

TIME  SINCE  LAST  REST/WORK  BREAK:  The  amount  of  Cine  subject  has  been  on  duty  or  since 

shift  began. 

HOS :  Primary,  secondary  end  duty  MOS  in  that  order.  Do  not  interfere  with  unit.  Obtain 
froa-NCO  date  collector  and  fill  in  later. 

KOS  EXPERIENCE:  Tine  in  duty  MOS.  Obtain  Iron  KCO  data  collector. 

TIS:  Length  of  active  federal  pilitary  service.  Obtain  froa  NCO  data  collector. 

ENVIRONMENT:  Score  all  Iteas  as:  0  -  Optimal  conditions 

M  -  Mildly  adverse;  some  negative  effect 
S  -  Severely  adverse;  substantial  aejative  effect 
WA  -  Environment  not  applicable 

HEATHER:  Did  weather  conditions  (precipitation  etc.)  inhibit  individual  in  coaplctlng 
the  task! 

TERRAIN:  Did  the  topography  (vegetation,  road  network,  etc)  inhibit  Individual  In  conpleting 
the  task! 

EQUIPMENT:  Did  equipment  shortage,  unscrvlceabllity,  ete.  Inhibit  individual  in  completing 
the  task? 

MIGHT:  Did  natural  light  have  any  effect  on  individual  perforsance? 

LEADERSHIP:  If  supervision  is  applicable  to  event,  did  its  quality  affect  performance? 
INSTRUCTIONS:  Wes  individual  properly  briefed? 

ACTIVITY  LEVEL:  Has  individual  unduly  stressed  by  the  frequency  of  events  or  external 
factors?  Do  not  confuse  with  fatigue. 

•SCORIKC  CRITERIA:  Score  each  event/sub-cvent  as;  S  -  Satisfactory  B  -  Unsatisfactory 

NA  -  Not  applicable 

PERFORMANCE  SCORE:  (1-7): 

7  -  Performed  all  tasks  In  a  superior  Banner;  equivalent  to  the  perforsance  of  an 
outstanding  soldier. 

6  -  Performed  most  tasks  in  a  superior  aanner,  all  others  at  oinloua  standards; 

equivalent  te  the  perforaiancc  of  a  superior  soldier. 

S  -  Performed  sone  casks  in  a  superior  nanner,  all  others  at  niolwum  standards; 

equivalent  to  the  performance  of  an  excellent  soldier. 

4  -  Performed  all  tasks  st  minimum  standards;  equivalent  to  the  performance  of  an 
average  soldier. 

3  -  Performed  most  tasks  at  alniaua  standards;  seme  tasks  were  failed;  equivalent 
to  the  perforaance  of  a  marginal  soldier. 

2  -  Performed  s  few  tasks  at  aininus  standards  but  aost  were  failed;  equivalent  to 
the  perforaance  of  an  unsatisfactory  soldier. 

1  -  Performed  ell  tasks  so  inferior  ss  to  question  MOS  qualification  of  individual. 

ENVIRONMENT  SCORE  (0-3):  Average  letter  scores  froa  environmental  section  above  end  apply 

to  the  following  scale,  place  a  number  at  the  bottoa  of  csch 
column  in  space  provided: 

0  0.5  1-0  hi  ?Q  ?■>  3.0 

0  M  S 

OBSERVER:  Name  of  observer 


EVENT  SHEET  NUMBER:  Sheet  sequence  number  if  observer  completes  mute  than  one  sheet  for  ssae 
individual  event. 


Te  be  filled  In  in  advance. 


imC-_ 


PROJECT  RF.F  VAC 

INDIVIDUAL  PERFORMANCE  SCORE  SHEET 
(Thru*  O'nservacluns  Xiniivs) 
INDIVIDUAL  EVENT  RATING  KOOULE 


EVENT  TITLE  Investigate  Traffic  Arctdcncs 


TOE  AUTHORIZED  JOS  TITLE _ Military  Pollccaen 

TEAM  CODE  SEQUENCE  NUMBER  »'tU 


REFERENCE  DATA: 

Naoc-Fenala 

Kane-Cshort 

Unit 

DTC 

Observation  Tlaic  Length 

Tloe  Since  Last  Rest/L’ork  Break 

KOS 

MOS  Exserience  (Tine) 

T1S 

ENVIRONMENT:  • 

Vaatkar 

Terrain 

' 

Zquipaent 

NlShC 

Leadership 

Instructions 

Activity  Level 

SCORING  CRITERIA: 

X.  Assesses  situation  end  Insures 
continued  flow  of  traffic. 

2.  Renders  first  aid  and  assists 
with  the  evacuation  of  the  Injured. 

3.  Protects  the  accident  scene 
Midi  essential  facts  arc  gathered. 

4.  Kota  positions  of  fatalities 
sad  outline  la  chalk. 

3.  Aid  In  clearing  obstructions.  " 

••  -V V,  •  J*-.-. 

4.  Assists  vith  evacuation  of  ;  ; 

daaagod  vehicles.  '5/5 

;>V '  '  *'■  »  ? /»  >!♦>’■* 

2.  Requests  additional. KP  "" 

support,  (requests  eoergency 
vehicles  and  flee  and  aohulat.ee). 

8.  Secures  end  protects  evi¬ 
dence  by  cordoning  off  area. 

9.  Identifies  and  secures 
stateneuts  froo  drivers  and 
witnesses. 

10.  Ktasures  and  records  skld- 
oarVs  and  final  position  of 
vehicles. 

11.  Prepares  traffic  accident 
report  (DA  Pom  3946)  to  Include 
basic  sketch  of  scene. 

12.  Reestablished  nornsl  flov  of 
traffic  after  accident  scene  has 
been  cleared  of  vehicles  and  debris. 
PERFORMANCE  SCORE  (1-7): 

. 

-1;,.  . 

r* 

'  j  •*  * 

■  • 

. 

■ 

ENVIRONMENT  SCORE  (0-3): 

OBSERVER: 


EVENT  SHEET  HUM  TIER : 


